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Abstract 

We study the origin of quantum probabilities as arising from non-boolean propositional- 
opcrational structures. We apply the method developed by Cox to non distributive lattices 
and deduce non-kolmogorvian probability measures of quantum mechanics. We outline a 
general framework for deducing probabilities for general propositional lattices generalizing 
the method developed by Cox. 
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1 Introduction 

Quantum probabilities have been an intriguing question since the very beginning of quantum 
theory. It was rapidly realized that probability amplitudes of quantum process obeyed different 
rules, as for example, the sum rule of probability amplitudes giving rise to interference terms 
or the nonexistence of joint distributions for noncommuting observables. In 1936 von Neumann 
wrote the first work ever to introduce quantum logics PJ, proving that quantum mechanics 
requires a propositional calculus substantially different from all classical logics. In the paper, 
he rigorously isolated a new algebraic structure for quantum logics. The concept of creating 
a propositional calculus for quantum logic had been already outlined by him in 1932. But in 
1936 the necessity of devising a new propositional calculus was demonstrated through several 
proofs. For example, photons cannot pass through two successive filters which are polarized 
perpendicularly (e.g. one horizontally and the other vertically), and therefore, a fortiori, it 
cannot pass if a third filter polarized diagonally is added to the other two, either before or after 
them in the succession. However, if the third filter is added in-between the other two, the photons 
do pass through. Such experimental fact is translatable into logic as a non-commutativity of 
conjunction and lies beyond classical considerations. 

Quantum and classical probabilities have points in common as well as differences. These differ- 
ences has been largely studied in the literature [21 [3j SI El d El [9] - 
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There exi st t wo important axiomatizations of probabilities. One of them was provided by Kol- 
mogorov |10j . a set theoretical approach based on boolean sigma algebras of a sample space. 
Probabilities are defined as measures over subsets of a given set. Thus, the Kolmogorovian 
approach is set theoretical and usually identified (but not necessarily) with a frequentistic in- 
terpretation of probabilities. It was then realized that quantum probabilities can be formulated 
as measures over non boolean structures (instead of boolean sigma algebras), therefore the 
term non-boolean or non-kolmogorovian probabilities [6]. It is remarkable that the creation of 
quantum theory and the works on the foundations of probability by Kolmogorov where both 
developed at the same time in the twenties. 

An alternative approach to the Kolmogorovian construction of probabilities was developed by 
R. T. Cox I12j. Cox starts with a propositional calculus, intended to represent assertions 
which represent our kno wle dge about the world or system under investigation. As is well known 
since the work of Boole [13], propositions of classical logic (CL) can be represented as a Boolean 
lattice, i.e., an algebraic structure endowed with lattice operations "A", "V" and "->", which 
are intended to represent conjunction, disjunction and negation respectively, and a partial order 
relation "<" which is intended to represent logical implication. Boolean lattices (as seen form an 
algebraic point of view) can be characterized by axioms [14^ 115 1 [T6], By considering probabilities 
as an inferential calculus on a boolean lattice, Cox showed that the axioms of probability can be 
deduced as a consequence of lattice symmetries, as well as entropy as a measure of information. 
Thus, differently form the set theoretical approach of Kolmogorov, the approach by Cox considers 
probabilities as an inferential calculus, and because of its natural links with entropy as a measure 
of information, is directly linked to Jaynes's MaxEnt principle |17t 118], 

It was recently shown that Feymann's rules of quantum mechanics can be deduced from boolean 
structures using a variant of Cox method [HI GOJ EH [22j [23] (see also [EICE])- This is done by: 

• first defining an operational propositional calculus on a quantum system under study 
(which results a distributive lattice), and after that, 

• postulating that any quantum process (interpreted as a proposition in the operational 
propositional calculus) can be represented by a pair of real numbers and, 

• using a variant of the method developed by Cox, showing that these pairs of real numbers 
obey the sum and product rules of complex numbers, and can then be interpreted as the 
quantum probability amplitudes which appear in Feymann's rules. 

There is a long tradition with regards to the application of lattice theory to physics. The 
quantum logical (QL) approach to quantum theory (and physics in general), initiated by von 
Neumann in [23] , has been a traditional tool for studies on the foundations of quantum mechanics 
(see for example [SilliaEaililEHlEOlE^ complete bibliography 

[3], [36], and [37]). 

The (QL) approach to physics bases itself on defining elementary tests and propositions for 
quantum systems and then, studying the structure of these propositional structures. This is done 
in an operational way \27\ [38j [4*01 [4T| [4"2] , and is susceptible of considerable generalization 
to arbitrary physical systems (not necessarily quantum ones). That is why the approach is 
also called operational quantum logic (OQL). One of the most important goals of OQL is to 
impose operationally motivated axioms on a lattice structure in order that it can be made 
isomorphic to a projection lattice on a Hilbert space. There are different positions in the 
literature about the question of whether this goal has been achieved or not, and also, of course, 
alternative operational approaches to physics, as the convex operational one [?, ?, ?,[2]. The 
operational approach presented in [2] bases itself only in the convex formulation of any statistical 
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theory, and it can be shown that the more general structure which appears under reasonable 
operational considerations is an a-orthocomplemented orthomodular poset, a more general class 
than orthomodular lattices (the ones which appear in quantum theory). We will come back to 

these issues and review the definitions for these structures below. 

In this work we ask the following questions: 

• what happens if the Cox's method mentioned above is applied to general lattices (rep- 
resenting general, not necessarily distributive, physical systems)? 

• does the logical underlying structure of the theory determine the form and properties 
of the probabilities? 

• is it possible to generalize Cox's method to arbitrary lattices? 

As we shall see below, it is possible to use these questions to derive quantum probabilities. 
We will show that once the operational structure of the theory is determined, the general 
properties of probability theory are determined. We also discuss the implications of this 
derivation for the foundations of quantum phy sics and probability theory, and compare with 
different approaches: the one presented in [21], the OQL approach, the operational approach 
of [2], and the traditional one (represented by the von Neumann formalism of Hilbertian 
quantum mechanics |44| ) . 

The approach presented here shows itself as susceptible of great generalization: we provide an 
algorithm for developing generalized probabilities using a combination of the Cox's method 
with the OQL approach. This opens the door to the development of more general probability 
and information measures. This approach exhibits the advantage that, in the particular case 
of quantum mechanics, it include s mi xed s tate s, unlike other approaches based only on pure 
states (like the ones presented in [23] and fl"9]). 

The paper is organized as follows. In Section[2]we review the OQL approach to physics as well as 
lattice theory. In Section [3] we review Kolmogorov's and Cox' approaches to probability. After 
that, in Section HI we review quantum probabilitie s and th ei r d iff eren ces w ith classical ones. 
In Section [5] we discuss the approach developed in [19], [20], [21], [22j . and [23]. In Section [6] 
we apply Cox's method for deriving non-Kolmogorovian probabilities from non-boolean lattices 
and study several examples. Finally, in section [8] some conclusions are drawn. 

2 The lattice/operational approach to physics 

The quantum logical approach to physics is vast and includes different programs. We will 
concentrate on the path followed by von Neumann and the operational approach developed by 
Jauch, Piron, and others. First, we recall the relationship between projection operators and 
elementary tests i n Q M. After studying the examples of lattices applied to QM and CM, as 
first introduced in [23], we review the main features of the QL approach. 

2.1 Elementary notions of lattice theory 

A partially ordered set (also called a poset) is a set X endowed with a partial ordering relation 
"<" satisfying 

• 1- For all x, y E X, x < y and y < x entails x = y 

• 2- For all x,y,z£X,iix<y and y < z, then x < z 
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The notation u x < y" is used to denote u x < y" or a x = y" . A lattice C will be a poset in which 
any two elements a and b have a unique supremum (the elements' least upper bound "a V 6" ; 
called their join) and an infimum (greatest lower bound "a A 6"; called their meet). Lattices can 
also be characterized as algebraic structures satisfying certain axiomatic identities imposed on 
operations "V" and "A" . For a complete lattice all its subsets have both a supremum (join) and 
an infimum (meet). 

A bounded lattice has a greatest (or maximum) and least (or minimum) element, denoted 1 and 
by convention (also called top and bottom, respectively). Any lattice can be converted into 
a bounded lattice by adding a greatest and least element, and every non-empty finite lattice 
is bounded. For any set A, the collection of all subsets of A (called the power set of A) can 
be ordered via subset inclusion to obtain a lattice bounded by A itself and the null set. Set 
intersection and union represent the operations meet and join, respectively. 

Every complete lattice is a bounded lattice. While bounded lattice homomorphisms in general 
preserve only finite joins and meets, complete lattice homomorphisms are required to preserve 
arbitrary joins and meets. If P is a bounded poset, an orthocomplementation in P is a bounded 
lattice in which there exists a unary operation . .)" such that: 

-Ha))) = a (la) 

a < b — >^b<^a (lb) 

aV-ia and a A-ia exist and both 

a V —>a = 1 (lc) 

aAn fl = (Id) 

hold. A bounded poset with ortocomplementation will be called an orthoposet. An ortholattice, 
will be an orthoposet which is also a lattice. For a, b € C (an ortholattice or orthoposet), we 
say that a is orthogonal to b (a_L6) iff a < -*b. 

Distributive lattices are lattices for which the operations of join and meet are distributed over 

each other. A complete complemented lattice that is also distributive is a Boolean algebra. For 

a distributive lattice, the complement of x, when it exists, is unique. The prototypical examples 

of Boolean algebras are collections of sets for which the lattice operations can be given by set 

union and intersection, and lattice complementation by set theoretical complementation. 

A modular lattice is one that satisfies the following self-dual condition (modular law or modular 

identity) 

x < b — > x V (a A b) = (x V a) A b (2) 

Modular lattices arise naturally in algebra and in many other areas of mathematics. For example, 
the subspaces of a finite dimensional vector space form a modular lattice. Every distributive 
lattice is modular. In a not necessarily modular lattice, there may still be elements b for which 
the modular law holds in connection with arbitrary elements a and x (< b). Such an element 
is called a modular element. Even more generally, the modular law may hold for a fixed pair 
(a,b). Such a pair is called a modular pair, and there are various generalizations of modularity 
related to this notion and to semi-modularity. 

An orthomodular lattice will be an ortholattice satisfying the orthomodular law: 

x < b — > x V (-.x A b) = b (3) 
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For example, the lattice C(H) of closed subspaces of a Hilbert space H is orthomodular. C{H) is 
modular iff H is finite dimensional. In addition, if we give the set T > P {H) of (bounded) projection 
operators on H an ordering structure by defining P < Q iff V(H) < Q(H), then Vp{H) is lattice 
isomorphic to C{H), and hence orthomodular [24j . 

The concept of lattice's atom is of great physical importance. If C has a null element 0, then an 
element x of C is an atom if < x and there exists no element y of C such that < y < x. One 
says that C is: 

i) Atomic, if for every nonzero element x of £, there exists an atom a of £ such that a = x 

ii) Atomistic, if every element of £ is a supremum of atoms. 

2.2 Elementary measurements and projection operators 

In QM, an elementary measurement given by a yes-no experiment (i.e., a test in which we get 
the answer "yes" or the answer "no"), is represented by a projection operator. If R is the real 
line, let B(M) be the family of subsets of R such that 

• 1 - The family is closed under set theoretical complements. 

• 2 - The family is closed under denumerable unions. 

• 3 - The family includes all open intervals. 

The elements of B(M) are the Borel subsets of M [45] . In QM, a projection valued measure 
(PVM) M, is a mapping 

M : B(1Z) -> V(U) (4a) 

such that 

M(0) = (4b) 
M(R) = 1 (4c) 

M(U j (B j )) = Y,M(B j ), (4d) 

i 

for any disjoint denumerable family Bj. Also, 

M(B C ) = 1 - M(.B) = (M(B)) 1 - (4e) 

All operators representing observables can be expressed in terms of PVM's (and so, reduced to 
sets of elementary measurements), via the spectral decomposition theorem, which asserts that 
the set of spectral meas ure ments may be put in a bijective correspondence with the set A of self 
adjoint operators of H [45j . 

Any quantum system represented by a separable Hilbert space % has associated a lattice, formed 
by all its closed subspaces C v j^{T-i) =< V(H), H, ©, -i, 0, 1 >, where is the empty set 0, 
1 is the total space H, © the closure of the sum, and _, (5') is the orthogonal complement of 
a subspace S [5]. Closed subspaces can be put in one to one correspondence with projection 
operators. Thus, elementary tests in QM, which are represented by projection operators, can be 
endowed with a lattice structure. This lattice was called "Quantum Logic" by Birkhoff and von 
Neumann [53] . We will refer to this lattice as the von Neumann-lattice (C v j\f(7i)) [5]. 
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The analogous of this structure in Classical Mechanics (CM) was provided by Birkoff and von 
Neumann [23]. Take for example the following operational propositions on a classical harmonic 
oscillator: "the energy is equal to Eq" and "the energy is lesser or equal than Eq" . The first 
one corresponds to an ellipse in phase space, and the second to the ellipse and its interior. This 
simple example shows that operational propositions in CM can be represented by subsets of the 
phase space. Thus, given a classical system S with phase space V, let V(T) represent the set 
formed by all the subsets of V. Then, the set of operational propositions can be represented by 
the elements of V(T). Thus, ^(r ), as well as ViT-L), can be structured as a lattice of propositions. 
As we have seen in Section 12.21 the operational propositions of a classical systems S can be 
identified with the set of subsets V(T) of its phase space T. This set can be enbdowed with a 
lattice structure as follows. If "V" is represented by set union, "A" by set intersection, "-i" by 
set complement (with respect to V), and and 1 are represented by and T respectively, then 
< V(F), PI, ©, -i, 0, 1 > forms a complete bounded lattice. This is the lattice of propositions 
of a classical system, which as is well known, is a boolean one. 

2.3 The Quantum Logical Approach to Physics 

We have seen that operational propositions of quantum and classical systems can be endowed 
with lattice structures. These lattices where boolean f or c lassical systems, and non boolean for 
quantum ones. This fact, discovered by von Neumann |24| . raised a lot of interesting questions. 
The first one is: is it possible to obtain QM (as well as CM) by imposing suitable axioms 
on a lattice structure? The surprising answer is yes, it is possible. But the road which led to 
this result was fairly difficult and full of obstacles. In the first place, it was a very difficult 
mathematical task to demonstrate that a suitably chosen set of axioms on a lattice would yield 
a representation theorem which would allow one to recover Hilbertian QM. The first result was 
obtained by Piron, and the final demonstration was given by Soler in 1995 [46J (see also [3], 
page 72). One of the main advantages of this approach is that the axioms imposed on a lattice 
structure could be given a clear operational interpretation: unlike the Hilbert space formulation, 
whose axioms have the disadvantage of being ad hoc and physically unmotivated, the quantum 
logical approach is clearer and more intuitive from a physical point of view. 

The second important question raised by the von Neumann discovery was: given that QM and 
CM can be described by operational lattices, is it possible to formulate the entire apparatus 
of physics in lattice theory terms? Given any physical system, quantum, classical, or obeying 
more general tenets, it is always possible to define an operational propositional structure on it 
using the notion of elementary tests. A very general approach to physics can be given using 
event structures, which are sets of events endowed with a probability measures satisfying certain 
axioms [2J. It can be shown (see [2], Chapter 3) that any event structure is isomorphic to a 
a-orthocomplete orthomodular poset, which is an orthocomplemented poset V, satisfying the 
orthomodular identity ([3]), and for which if a, £ V and ai-Laj (i ^ j), implies that \J ai exists 
for i = 1,2,.... Remark that event structures (or cr-orthocomplete orthomodular posets) need 
not to be lattices. However, lattices are very general structures and encompass most important 
examples. Consequently, we will work with orthomodular lattices in this paper (and indicate 
which results can be easily extended to cr-orthocomplete orthomodular posets). 
There are other general approaches to statistical theories, one of them is the convex operational 
one I48 t l4"9" l l5l)j. which consists on imposing axioms on a convex structure (formed by physical 
states). Indeed, the convex operational approach is even more general than the quantum logical, 
but we will not discuss in detail this issue here. 
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3 Cox vs. Kolmogorov 



In this Section we will review two different approaches to probability theory. On one hand, the 
Cox's approach, in which probabilities are considered as measures of the plausibility of a given 
event or happenings. On the other hand, the traditional Kolmogorovian one, a set theoretical 
approach which is compatible with the interpretation of probabilities as frequencies. 



3.1 Kolmogorov 

Given a set Q, let us consider a cr-algebra E of £1. Then, a probability measure will be given by 
a function \i such that 

fj, : E [0, 1] (5a) 

which satisfies 

M(0) = (5b) 
[x(A c ) = l-v(A), (5c) 

where (. . .) c means set-theoretical-complement and for any pairwise disjoint denumerable family 
{A}iei 

m(U^)=E^) ( 5d ) 

iel i 

where conditions © are the well known axioms of Kolmogorov. The triad (O, E, fj,) is called 
a probability space. Depending on the context, probability spaces obeying Eqs. ^) are usually 
referred as Kolmogorovian, classical, commutative or boolean probabilities [2]. 
It is possible to show that if (fi, E, /i) is a kolmogorovian probability space, the inclusion- 
exclusion principle holds 

H(A U B) = p(A) + p(B) - fj,(A n B) (6) 

As remarked in |51| . Eq. ([6]) was considered as crucial by von Neumann for the interpretation 

of fi(A) and fi(B) as relative frequencies. If iV^^ym, ^ix(A)^ -^fi(B) > ^fi(AnB) are the number of 

times of each event to occur in a series of N repetitions, then ([6]) trivially holds. 

This principle does no longer hold in QM, and this is linked to the non-boolean character of 

this theory. Thus, the relative frequencies interpretation of quantum probabilities is problematic 

here. 



3.2 Cox's approach 

Propositions of classical logic can be endowed with a Boolean lattice structure. The logical 
implication " — is associated with a partial order relation "<", the conjunction "and" with 
the greatest lower bound "A", disjunction "or" with the lowest upper bound "V", and negation 
"not" is associated with complement . A boolean lattice is a structure satisfying the following 
axioms: 

• LI. xVi = i, x Ax = x (idempotence) 

• L2. x V y = y V x, x A y = y A x (commutativity) 

• L3. x V (y V z) = (x V y) V z, x A (y A z) = (x A y) A z (associativity) 
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• L4. x V (x A y) = x A (x V y) = x (absortion) 

• Dl. x A (y V z) = (x A y) V (x A z) (distributivity 1) 

• D2. x V (y A 2f) = (x V y) A (x V 2) (distributivity 2) 

As is well known, boolean lattices can be represented as subsets of a given set, with "<" repre- 
sented as set theoretical inclusion C, "V" represented as set theoretical union "U", "A" repre- 
sented as set intersection "n", -i represented as the set theoretical complement "(. . .) c ". 

As a typical feature, Cox derives classical probability theory as an inferential calculus on boolean 
lattices. A real valued function ip representing the degree to which a proposition y implies 
another proposition x is postulated, and its properties deduced from the algebraic properties of 
the boolean lattice. It turns out that (p(x\y) -if suitably normalized- satisfies all the properties 
of a Kolmogorovian probability (Eqs. ([5])). The deduction will be omitted here, and the reader 
is referred to [T2], (TTJ [HI [151 120] for detailed expositions. 

Despite their formal equivalence, there is a great conceptual difference between the approaches 
of Kolmogorov and Cox. In the Kolmogorovian approach probabilities are naturally interpreted 
as relative frequencies in a sample space. On the other hand, the approach developed by Cox, 
considers probabilities as a measure of the degree of belief of an intelligent agent (which may be 
a machine), on the truth of proposition x if it is known that y is true. This measure is given by 
the real number cp(x\y), and in this way the Cox's approach is more compatible with a Bayesian 
interpretation of probability theory. 



4 Quantum vs. classical probabilities 

In this Section we will introduce quantum probabilities and look at their differences with classical 
ones. Great part of the hardship faced by Birkhoff and von Neumann in developing the logic of 
quantum mechanics were due to the inadequacies of classical probability theory. Their point of 
view was that a statistical physical theory could be regarded as a probability theory, founded 
on a calculus of events. These events should be the experimentally verifiable propositions of 
the theory, and the structure of this calculus was to be deduced from empirical considerations, 
which for the quantum case, resulted in an orthomodular lattice |24} 19]. 

In the formulation of both classical and quan tum probabilities, states can be regarded as rep- 
resenting consistent probability assignments |47t [TJ. In the quantum mechanics instance this 
"states as mappings" visualization is achieved via postulating a function [5] 



such that: 



s : V{%) -> [0; 1] 

s(0) = (0 is the null subspace). 
s(P ± ) = l-s(P), 



(7a) 

(7b) 
(7c) 



and, for a denumerable and pairwise orthogonal family of projections 
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Gleason's theorem [521 153j . tell us that if the dimension of Ti > 3, any measure s satisfying ([7|) 
can be put in correspondence with a trace class operator (of trace one) p s via the correspondence: 



S 



s(P) := tr(p s P) 



(8) 



And vice versa: using equation (JSj) any trace class operator of trace one defines a measure as 
in ([7]). Thus, equations define a probability: to any elementary test (or event), represented 
by a projection operator P, s(P) gives us the probability that the event P occurs, and this 
is experimentally granted by the validity of Born's rule. But in fact, ([7]) is not a classical 
probability, because it does not obeys Kolmogorov's axioms ©. The main difference comes from 
the fact that the cr-algebra in Q is boolean, while V(Ti) is not. Thus, quantum probabilities are 
also called non-kolmogorovian (or non-boolean) probability measures. The crucial fact is that, 
in the quantum case, we do not have a a-algebra, but an orthomodular lattice of projections. 
One of the most important ensuing differences expresses itself in the fact that Eq. (j^J) is no 
longer valid in QM. Indeed, it may happen that 



for A and B suitably chosen elementary sharp tests. Another important difference comes from 
the difficulties which appear when one tries to define a quantum conditional probability (see for 
example j2] and [6] for a comparison between classical and quantum probabilities). Quantum 
probabilities may also be considered as a generalization of classical probability theory: while in 
an arbitrary statistical theory a state will be a normalized measure over a suitable C*-algebra, 
the classical case is recovered when the algebra is commutative [2]. 

We are thus faced with the following fact: on the one hand, there exists a generalization of 
classical probability theory to non-boolean operational structures. On the other hand, Cox 
derives classical probabilities from the algebraic properties of classical logic. As we shall see in 
detail below, this readily implies that probabilities in CM are determined by the operational 
structure of classical propositions (given by subsets of phase space). The question is: is it possible 
to generalize Cox's method to arbitrary propositional structures (representing the operational 
propositions of an arbitrary theory) even when they are not boolean? What would we expect to 
find? We will see that the answer to the first question is yes, and for the second, it is reasonable 
to recover quantum probabilities (Eq. ©). This approach may serve as a solution for a problem 
posed by von Neumann. In his words: 

"In order to have probability all you need is a concept of all angles, I mean, other than 
90. Now it is perfectly quite true that in geometry, as soon as you can define the right 
angle, you can define all angles. Another way to put it is that if you take the case of 
an orthogonal space, those mappings of this space on itself, which leave orthogonality 
intact, lives all angles intact, in other words, in those systems which can be used as 
models of the logical background for quantum theory, it is true that as soon as 
all the ordinary concepts of logic are fixed under some isomorphic transformation, 
all of probability theory is already fixed... This means however, that one has a 
formal mechanism in which, logics and probability theory arise simultaneously and 
are derived simultaneously. [5T] " 

and, as remarked by M. Redei |51j : 

"It was simultaneous emergence and mutual determination of probability and logic 
what von Neumann found intriguing and not at all well understood. He very much 
wanted to have a detailed axiomatic study of this phenomenon because he hoped that 
it would shed "... a great deal of new light on logics and probability alter the whole 
formal structure of logics considerably, if one succeeds in deriving this system from 
first principles, in other words from a suitable set of axioms." (quote) He emphasized 




(9) 
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-and this was his last thought in his address- that it was an entirely open problem 
whether/how such an axiomatic derivation can be carried out." 

The problem posed above has remained thus far unanswered, and this work may be considered as 
concrete step towards its solution. Before entering the subject, let us first review an alternative 
approach. 



5 Alternative derivation of Feymann's rules 

Refs. [21], [22], and |23] present a novel derivation of Feyman's rules for quantum mechanics, 
based on a modern reformulation [20j of Cox's ideas on the foundations of probability |11[ 112). 
as follows: First, an experimental logic of processes is defined for quantum systems. This is done 
in such a way that the resulting algebra is a boolean one. Given a sequence of n measurements 
Mi,. . .,M n on a given system, with results mi, rri2, m n , the later are organized in the 
proposition A = [mi,m2, . . . , m n , as a particular process. 

If each of the m^'s has two possible values, 1 and 2, a possible proposition of three measurements 
is for example A\ = [1, 2, 1]. Another one could be Ai = [1, 1, 2] and so on. If we want to "coarse 
grain" a certain measurement, say M2, and forget about its particular result, we can unite the 

two outcomes in a joint outcome (1,2), yielding the experiment (measurement) M2. Thus, a 

possible sequence obtained by the replacement of M2 by M2 could be [1, (1, 2), 1]. This is used 
to define a logical operation 

[mi, . . . , (m;, rr4), . . . ,m n ] = [mi, . . . ,m if . . .,m n ] V [mi, . . . ,m-, . . . ,m n ] (10) 

It is intended that sequences of measurements can be compounded. For example, if we have 
[mi, 7712] and ^2,1713], we have also the sequence [mi, 7712,7713], paving the way for the general 
definition 

[mi, ...,rrij,... ,m n ] = [m 1 , ...,mj]- [mj, . . . ,m n ] (11) 
Thus defined, these operations satisfy 

A V B = B V A (12a) 



(A V B) V C = A V {B V C) (12b) 
(A ■ B) ■ C = A ■ (B ■ C) (12c) 
(AVB)-C = (A-C)V(B-C) (12d) 

C ■ (Ay B) = (C ■ A)y (C ■ B), (12e) 

yielding a boolean algebra. Remark that the logic derived above is empirical (operational). 
Thus, its main features depend on contingent facts which have to do with i) the characteristics 
of the system under study and ii) the definitions made by the experimenter. Accordingly, the 
fact that the algebra is boolean does not constitute a necessary fact. This should be clear if we 
take into account that the OQL approach gives rise to a non-boolean lattice by defining different 
questions and elementary tests. In this sense, both approaches, the one presented in [21] and 
OQL, deviate from the original work of Cox, because the boolean algebra used by Cox is not 
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intended to be an empirical one, but the logic that we use as thinking creatures. This is an a 
priori condition, not a matter to be settled either operationally or empirically. 
We had already seen in Section [3] that the method of Cox consists of deriving probability and 
entropy from the symmetries of a boolean lattice, intended to represent our propositions about 
the world, while probability is interpreted as a measure of knowledge about an inference calculus. 
Once equations [12] are written, the set up for the derivation of Feynman's rules is ready. The 
path to follow now is to apply Cox's method to the symmetries defined by equations H2J But 
this cannot be done straightforwardly. In order to proceed, an important assumption has to be 
made: each proposition will be represented by a pair of real numbers. This is a strong ontological 
assumption, justified in [21] using Bohr's complementarity principle. These kind of ontological 
assumptions are susceptible to the same criticism launched against the OQL approach of Jauch 
and Piron, i.e., the fact that they are not fully operational. As we shall se below, the method 
proposed in this article is more direct and systematic, and makes the introduction of these 
indirect and involved assumptions somewhat clearer. 

Once that a pair of real numbers is assigned to any proposition, the authors of [21] reasonably 
assume that equations [12] induce operations in pairs of reals numbers. If propositions A, B, etc. 
are represented by pairs of real numbers a, b, etc., then, we should have 



a Vb 


= b V a 


(13a) 


(aVb) Vc 


= a V (b Vc) 


(13b) 


(a • b) • c 


= a(bc) 


(13c) 


(a V b) • c = 


(a • c) V (b • c) 


(13d) 


c • (a V b) = 


(c • a) V (c • b) 


(13e) 



We easily recognize in (|13p operations satisfied by the complex numbers' field. If they constituted 
the only possible instance, propositions represented by pairs of real numbers would be complex 
numbers, and thus, we could easily have Feyman's rules. However, complex numbers are not the 
only entities that satisfy [T3l There are other, and thus, extra assumptions h ave to be ma de in 
order to restrict possibilities. These additional assumptions are presented in [22J and [21J, and 
improved in [23]. We list them below. 

• Pair symmetry 

• Additivity condition 

• Symmetric bias condition 

Leaving aside the fact that these extra assumptions are more or less reasonable (justifications 
for their use arc given in [23J), it is clear that the derivation is quite indirect and artificial: the 
"experimental" logic is thus defined in order to yield algebraic rules compatible with complex 
multiplication (and the rest of the strategy is to make further assumptions in order to discard 
other fields different than complex numbers). Further, the experimental logic characterized by 
equations [12] is not the only possibility, as we have seen in Section [2] 
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In the rest of this work, we will apply Cox's method to general propositional structures 
according to the quantum logical app r oac h. We will see that this allows for a new perspective 
(including the one presented in |21j . [20] . and [22]) which sheds light onto the structure of 
non-boolean probability, and is at the same time susceptible of great generalization, living 
the door open to the derivation of alternative kinds of probabilities. 

Yet anot her i mpor tant remark is in order. As noted in the Introduction, the work presented in 
|21j . |20| . and [22] -as well as ours- is a combination of two approaches: the one which defines 
propositions in an empirical way (something which it shares with the OQL approach) and that of 
Cox. Cox's spirit was to der ive probability out of Chomsky's generative propositional structures 
ingrained in our brain [53], and this boolean structure is independent of any experimental 
information. This does not imply, though, the empirical logics needs to satisfy the same algebra 
than pervades our thinking, and that is indeed what happens. In this sense, any derivation 
involving empirical or operational logics deviates from the original intent of Cox. As we shall 
see, this is not a problem, but an important advantage. 

6 Cox's method applied to non-boolean algebras 

As seen in Section[2j operationally motivated axioms imposed on a lattice's propositional structure 
can be used to describe quantum mechanics. Disregarding the discussion about the operational 
validity of this construction, we are only interested in the fact that the embodiment is feasible. 
Similar constructions can be made for many physical systems, beyond quantum mechanics: the 
connection between the theory and experience is given by an event structure (elementary tests), 
and these events can be organized in a lattice structure in many examples of interest. 
Thus, our point of departure will be the fact that physical systems can be represented by propo- 
sitional lattices, and that these lattices need not be necessarily boolean. We will consider atomic 
orthomodular lattices. Given a system S, and its propositional lattice C, we proceed to apply Cox's 
method in order to derive an inferential calculus on C. 

6.1 Classical Mechanics 

We start with classical mechanics (that theory satisfying Hamilton's equations). Given a classical 
system Sc, the propositional structure is a boolean one, isomorphic to a perfectly boolean lattice 
used in our logical language (i.e., it is the one used by Cox). Accordingly, as shown in Section 
the corresponding probability calculus has to be the one which obeys the laws of Kolmogorov 
(we will find that it satisfies -in particular- equations [5|), and the corresponding information 
measure is Shanon's, as expected. 

6.2 Quantum case 

As shown by Birkoff and von Neumann in [23], if we follow the above path and try to define 
the propositional structure for a quantum system Sq we find an orthomodular lattice C v j^{%) 
isomorphic to the lattice of projections V(7i). What are we going to find if we apply instead 
Cox's method? It stands to reason that we would encounter a non-boolean probability measure 
with the properties postulated in Section 0] (Eqns. (0). Let us see that this is indeed the case. 
The first thing to remark is that in this derivation we assume that we have a non-boolean lattice 
C v j^{%), isomorphic to the lattice of projections V{%), and we must show that the "degree of 
implication" measure «(•••) demanded by Cox's method must satisfy Eqns. ([7]). We will only 
consider the case of prior probabilities, and this will mean that we ask for the probability that 
a certain event happens, given a state of affairs, i.e., a concrete preparation of the system under 
certain circumstances (which could be naturally or artificially produced). Thus, we are looking 
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for a function to the real numbers s such that it is non-negative and s(P) < s(Q) whenever 
P < Q. 

Under these assumptions, let us consider the operation "V". As the direct sum of subspaces is 
associative, "V" will be associative too. If P and Q are orthogonal projections (P 1 Q), then, 
we will have that P A Q = (otherwise, there would be a vector in P which is not orthogonal 
to every vector of Q). Next, we consider the relationship between s(P), s(Q), and s(PV Q). As 
P A Q = 0, it should happen that 

s(PVQ) = F(s(P),s(Q)), (14) 

with F a function to be determined. At this point, it should be clear how to proceed. Add 
now a third proposition R (notice that, for doing this, we need a space of dimension d > 3, an 
interesting analogy with Gleason's theorem), such that P _L R, Q _L R, and Q _L P (and thus 
P A R = 0, Q A R = 0, and Q A P = 0). Build now the element (FVQ)V i?. Then, because of 
this symmetry, we arrive to the following result 

s((PV Q)V R) = s(PV (QV #)), (15) 

and thus, 

F(F(s(P), s(Q)), s(R)) = F(s(P),F(s(Q), s(R))). (16) 

The algebraic propertie s of associativity for V and _L are the only prerequisite for this result. 
Thus, proceeding as in [HI [151 EQ] (and using the solutions to functional equations studied in 
[55J), we have that -up to a re-scaling: 

s((PVQ)) = s(P) + s(Q)). (17) 

For any finite family of projections Pj, 1 < j < n, we have s(Pi VP2 V ■ ■ ■ VP„) = s(-Pi) + 5(^2) + 
• • • + s(P n ). Now, as any projection P satisfies P < 1, then s(P) < s(l), and we can assume 
without loss of generality the normalization condition s(l) = 1. Thus, for any denumerable 
pairwise orthogonal infinite family of projections Pj, we have for each n 

n n 

sC\i p 3 ) = Y,< p i)< 1 - ( 18 ) 

As s(Pj) > for each j, the sequence s n — s(.\J j=i Pj) is monotone, bounded from above, and 
thus converges. We write then 

00 00 
i=i 3=1 

and we recover condition (|7dp of the axioms of quantum probability. Now, given any proposition 
C v j\f{1-L), consider P 1 - . As P V P 1 - = 1, and P is orthogonal to P^~ : we have 

s(P VP 1 ) = s(P) + s(P L ) = s(l) = 1. (20) 

In other words 

s(P L ) = l-s(P), (21) 
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which is nothing but condition (jTc|) . On the other hand, as = V and 0_L0, then s(0) = 
s(0) + s(0), and thus, s(0) = 0, which is condition ()7b[) . 

TTiis Section shows that the algebraic properties of C v j^ determine the form of the quantum 

probabilities, which on the light of this discussion, do not need to be postulated. 

Thus, we have proved that s is a probability measure on C v j\f. Is there any possibility that s 

differ from an actual quantum probability (given by a density matrix and the Born's rule)? The 

answer is no, and this is granted by Gleason's theorem, because we have proved that s satisfies 

Eqns. (J7J), and Gleason's theorem leaves no alternative (if the dimension of H > 3). 

An important question is the following: which will be the effect of non-distributivity? As we 

saw in Section [4j classical probabilities are sub-additive, i.e., they satisfy 

M(iUB) < n(A) + fx(B), (22) 

and this is linked to the stronger assertion of Eq. ([6]) (see also [5] , page 104) . 

But it is indeed the case that the analogous of Eq. (|22p does not hold for quantum probabilities. 

We show below that this derivation is susceptible of generalization. Indeed, the derivation relies 

mainly on the algebraic properties of the lattice of projections, i.e., in its non-boolean lattice 

structure. 

6.3 A Finite Non-distributive Example 

There are many systems of interest which can be represented by finite lattices. Many toy models 
serve to illustrate special features of different theories. Let us start first by analyzing L12, a non 
distributive lattice which may be considered as the union of two incompatible experiments [56] . 
An example of L12 is provided by a firefly which flies inside a room. The first experiment is to 
test if the firefly shines on the right side (r) of the room, or on the left (1), or if it does not shine 
at all (n). Other experiment consists in testing if the firefly shines at the front of the room (f), 
or o n th e back (b) of it, or if it does not shine (n). The Hasse diagram of L12 is represented in 
Fig. E3 

Applying the Cox's method to the boolean sublattices of Lyi (and suitably normalizing), we 
obtain P(l) + P{r) + P{f) + P(b) + P{n) = 2 — P{n), a quantity which may be greater than 1. 
We explicitly assert then that the exclusion-inclusion law ([6]) does not hold. This is due to the 
global non-distributivity of L\2- It is also easy to show that P(l) + P(r) = P(f) + P(b), yielding 
a non trivial relationship between atoms (something which does not occur in a boolean lattice). 

6.4 General Derivation 

Let C be an atomic orthomodular lattice. We will compute prior probabilities, i.e., we will 
assume that C represents the propositional structure of a given system -physical or not-, and 
that we want to ascertain how likely is a given event, represented by a lattice element a € C, 
assuming that the system has undergone a preparation, i.e., we assume that there exists a state 
of affairs, represented by the top element 1. In other words, we will always assume that the 
"always true proposition" represents the system in an actual preparation or state of affairs. 
We must define a function s : C — > M, such that it is always non-negative 

s(a) > OVa € C (23a) 

and is also order preserving 

a < b — > s(a) < s(b). (23b) 

We will show that under these rather general assumptions, a probability theory can be deter- 
mined. The order preserving assumption readily implies that s(a) < s(l) for all a G C. We will 
also assume that s(l) = K, a finite real number. 
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Figure 1: Hasse diagram for Zq 2 . The boolean sublattices corresponding to the two complementary 
contexts are indicated by squared lines. 



Now, as an ortholattice is complemented (Eqs. (pQ)), we will always have that —>—ia = a for all 
a € C. Accordingly, 

s(-i-, a ) = 8(a), (24) 

for all a. Next, it is reasonable to assume that s(-io) is a function of s(^a), say s(-ia) = g(s(a)). 
Thus, Eqs. ([Taj and (J24J) imply 

g(g(s(a))) = s(a), (25) 

or, in other words, 

g(g(x)) = x, (26) 

for positive x. A family of functions which satisfy (|26|) are g(x) = x and g(x) = c — x, where 
c is a real constant. We discard the first possibility because if true, we would have s(0) = 
s(-il) = g(s(l)) = s(l). But if s(0) = s(l), because of < x < 1 for all x € £, we have 
s(0) = s(x) = s(l), and our measure would be trivial. Thus, the only non-trivial option is 
s(-ia) = c — s(a) and s(l) = k. 

Now, let us see what happens with the "V" operatio n. A s C is orthocomplemented, the orthog- 
onality notion for elements is available (see Section [2 . 1 1) . If a, b £ C and a_L6, because of (llbj) . 
we have that a A b = 0. Thud, it is reasonable to assume that s(a V b) is a function of s(a) 
and s(b) only, i.e., s(a V b) = /(s(a), s(6)). By associativity of the "V", (a V b) V c = a V (6 V c) 
for any a, 6, c £ £, and this implies then that s((a V b) V c) = s(a V (6 V c)). If a, 6, and c 
are orthogonal, we will have for the left hand side s((a V b) V c) = f(f(s(a),s(b)),s(c)) and 
s(a V (6 V c)) = /(s(a), f(s(b), s(c))) for the right hand side. Thus, 
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f(f(s(a), s(b)),s(c)) = f(s(a), f(s(b), s(c))), (27) 

or, in other words 

f(f(x,y),z) = f(x,f(y,z)), (28) 



for all x, y, and z between and 1. As shown in [55], the only solution (up to re-scaling) of ([28 
is f(x, y) = x + y. We have thus shown that if aJ-b 



s(aVb) = s(a) + s(b), (29) 



and we will also have 



s(ai V a 2 • • • V a n ) = s(ai) + s(b 2 ) H h s(a n ), (30) 

whenever a%, a 2 , • • • , a n are pairwise orthogonal. Suppose now that {ajjjgN is a family of 
pairwise orthogonal elements of C. For any finite n, we have that a\ V a 2 V • • • V a n < 1, and thus 
s(ai V a 2 V • • • Va n < 1) = s(ai) + s(b 2 ) + • • • + s(a n ) < s(l) = K. Then, s n = s{a\ V a 2 V • • • Va n ) 
is a monotone sequence bounded from above, and thus it converges to a real number. As 
V{«i}«eN = limn— ^oo VT=i a «> we can write 

oo 

s(\f{ai} ieN ) = ^Js(di). (31) 
i=i 

In any orthomodular lattice we have 1_L0 (because < ->1 = 0), and 1 V = 1. Thus, 
s(l V 0) = s(l) = s(l) + s(0). Accordingly, s(0) = 0. As -.1 = 0, s(0) = c - s(l). Thus, 
s(l) = c and c = K. We will not lose generality if we assume the normalization condition 
K = l. 

The results of this section show that in any orthomodular lattice, a reasonable measure s of 
plausibility of a given event must satisfy 

For any orthogonal denumerable family {aj}j £ N 

oo 
i=l 

s(-, a ) = 1 - s(a) (32b) 

s(0) = 0. (32c) 

Why do Eqs. (|32p define non-classical (non-Kolmogorovian) probability measures? In a non- 
distributive orthomodular lattice there always exist elements a and b such that 

(a A b) V (a A -.&) < a, (33) 

so that (using (a A -"6)±(a A b), s((aA->b) V (a A b)) = s(a A~>b) + s(a A 6) < s(a). The inequality 
can be strict, as the quantum case shows. F3ut in any classical probability theory, by virtue of 
the inclusion-exclusion principle, we always have s(a A ->b) + s(a A b) = s(a). This simple fact 
shows that our measures will be non-classical in the general case. 

We give in this way an answer to the problem posed by von Neumann and discussed in Section 
[D the algebraic and logical properties of the operational event structure determine uniquely the 
general form of the probability measures which can be defined over the lattice. Accordingly, we 
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Figure 2: Schematic representation of the method proposed here. A general theory T determines via the 
von Neumann approach the algebraic structure of the set of elementary tests. Then, by applying Cox's 
method, it is possible to determine the general properties of the canonical probability theory assigned to 
T. Next, by assigning particular values to prior probabilities of atoms, all states will be determined and 
the form C, the convex set of states. The quantum mechanical (QM) and the classical (CM) case are 
shown as the extreme instances of a vast family of theories. 



did present a generalization of the Cox method to non-boolean structures, namely orthomodular 
lattices, and then we have indeed deduced a generalized probability theory. 

Remark that boolean lattices are also orthomodular. This means that our derivation is a gener- 
alization of that of Cox, and when we face a boolean structure, classical probability theory will 
be deduced exactly as in [12\ [TT] . Another important remark is that the derivation presented 
here is also valid for cr-orthocomplete orthomodular posets (see Section [2] of this work), and 
thus, of great generality. 



7 A general methodology 

At this stage it is easy to envisage how a general method can be developed. One starts first by 
identifying the algebraic structure of the elementary tests of a given theory T. They determine 
all observable quantities and in the most important examples, they are endowed with a lattice 
structure. Once the algebraic properties of the pertinent lattice are determined in this way, 
Cox's method determines the general properties of prior probabilities. Of course, it does not 
determines the particular values that these probabilities may take on the atoms of the theory. 
The specification of these values amounts to determine a particular state of the system under 
scrutiny. The whole set of these states can be rearranged to yield a convex set. The path we 
have followed here is illustrated in Fig. with the classical and quantum cases as extreme 
examples of a vast family. 

The method described in this work can be seen as a general epistemological background for a 
huge family of scientific theories. In order to have a (quantitative) scientific theory, we must be 
able to make predictions on certain events of interest. Events are regarded here as propositions 



17 



susceptible of being tested. Thus, one starts from an inferential calculus which allows for quanti- 
fying the degree of belief on the certainty of an event x, if it is known that event y has occurred. 
The crucial point is that event structures are not always organized as boolean lattices (QM being 
perhaps the most spectacular example). Thus, in order to determine the general properties of 
the probabilities of a given theory (and thus all possible states by specifying particular values 
of prior probabilities on the atoms), we must apply Cox's method to lattices more general than 
boolean ones. 

8 Conclusions 

In this work we showed that it is possible to combine Cox's method for the foundations of 
classical probability theory and the OQL approach, in order to give an alternative derivation of 
non-kolmogorovian probabilities. In particular, we demonstrated that the usual postulate which 
states that a measure should be additive for orthogonal projections can be derived using lattice 
symmetries. 

In deriving quantum probability out of the lattice properties, we have shown in a direct way how 
non-booleanity forbids the derivation of a classical probability. This sheds light on the structure 
of quantum probabilities and on their differences with the classical case. 

Using Cox's approach, Shanon's entropy can be deduced as a natural measure of information 
over the boolean algebra of classical propositions [12\ [23l I20j . We will provide a detailed study 
of what happens with orthomodular latices elsewhere. 

The strategy followed in this work suggests that we are at the gates of a great generalization. 
The general rule for constructing probabilities (and if we are lucky, entropies) would read as 
follows: 

• 1 - We start by identifying the operational logic of our physical system. The characteristics 
of this "empirical" logic depends both on physical properties of the system and on the 
election of the properties that we assume in order to study the system. This can be done 
in a standard way, and the method is provided by the OQL approach. 

• 2 - Once the operational logic is identified, the symmetries of the lattice are used to de- 
fine the properties of the "degree of implication" function, which will turn out to be the 
probability function associated to that particular logic. Remark that the same physical 
system may have different propositional structures, depending of the election of the ob- 
servers. For example, if we look at the observable "electron's" charge, we will face classical 
propositions, but if wee look at its momentum and position, we will have a non-boolean 
lattice. 

This method has not only a physical interest, but a mathematical one, namely, solving the prob- 
lem of characterizing probability measures over general lattices. A final remark: our approach 
deviates from that of Cox, in the sense that we look for an empirical logic which would be in- 
trinsic to the system under study, and because of that, not only referred to our ignorance about 
it, but to assumptions about its nature. 
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